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(54) Method for molecular indexing categorising of expressed genes using restriction enzymes 



(57) This invention relates to a method for classify- 
ing (indexing) cDNA which has been reverse-tran- 
scribed from tissue- or cell-derived RNA, or DNA in a 
short period without duplication by using class-IIS 
restriction enzymes or a combination of a class-IIS and 
a ciass-ll restriction enzymes. According to this inven- 
tion, it is possible to analyse and diagnose variations 
such as tumors easily, correctly and promptly by com- 
paring the analyzed pattern of genes expressed in a cell 
or tissue sample with the analyzed pattern of normal 
genes. This method is also applicable to the search and 
isolation of genes of physiologically active substances 
that are potential pharmaceuticals or causative genes of 
hereditary diseases, as well as the isolation of those 
genes that are useful for improving agricultural prod- 
ucts. 
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Description 

FIELD OF THE INVENTION 

5 This invention relates to a method for molecular indexing which is applicable to the analysis and diagnosis of dis- 
eases such as cancers, the search and isolation of genes of physiologically active substances that are potential phar- 
maceuticals or causative genes of hereditary diseases, as well as the isolation of those genes that are useful for 
improving agricultural products. 

w BACKGROUND OF THE INVENTION 

For examining differences in gene expression between two tissues, there has been described a method wherein a 
portion (about 50-200 genes) of the expressed gene population is amplified by PCR (the polymerase chain reaction 
method) using any short primers and then separated by polyacrylamide gel electrophoresis [P. Liang and A. B. Pardee, 

15 Differential display of eukaryotic messenger RNA by means of the polymerase chain reaction., Science 257: 967-971 
(1992)]. However, in such differential display by means of PCR, only a portion of the whole gene population is amplified 
in principle and yet a plurality of bands are generated from the same gene. Furthermore, such display involves a large 
quantity of artifacts and thus is technically incomplete. Therefore, such display only shows differences in gene expres- 
sion between two tissues which are not remote from each other or differences in gene expression in celts. Such drffer- 

20 ential display has a problem that it cannot record the expression of individual genes. 

It is also possible to analyze variations in tissues or cells by determining the level of a particular gene in such tis- 
sues or cells through measuring the amount of its mRNA by Northern blot hybridization method. However, this method 
is not applicable when the target gene is not cloned or the base sequence thereof is unknown. In addition, this method 
is not suitable for the analysis of a large number of genes. For example, since genes being expressed in a certain cell 

ss are considered about 10,000 species, it will take for about two years even if Northern blot hybridization is performed for 
100 genes per week. Thus, this method is not practically useful. 

On the other hand, those restriction enzymes which belong to class IIS (hereinafter, referred to as "class- 1 IS restric- 
tion enzymes") are restriction enzymes having an ability to cut at a precise distance outside their recognition sites. 
Those fragments cut by a c!ass-l!S restriction enzyme are characterized to have non-identical, cohesive ends consist- 
so ing of several nucleotides. There have been known more than 30 class-HS restriction enzymes including Fok I, Bsm Fl, 
Bsm Al, Bbv I, Sfa Nl and Hga I. It is estimated that genes which have at least one cleavage site of Fok I, Bsm Fl or 
Bsm Al will be 97% of total genes. Brenner et al. has introduced a method of preparing a more detailed genome map 
using a class- 1 IS restriction enzyme which generates 4-rrt (nucleotide) sequences in place of conventional restriction 
enzymes [S. Brenner and K. J. Uvak, DNA fingerprinting by sampled sequencing, Proc. Natl. Acad. Sci. U.S.A., 

35 86:8902-6 (1989)]. There is also disclosed a method wherein a part of restriction enzyme fragments derived from a 
phage or cosmid is amplified by using those adaptors which are complementary to all possible 4-nt cohesive ends gen- 
erated by class-US restriction enzymes [D. R. Smith, ligation-mediated PCR or restriction fragments from large DNA 
molecules. PCR Methods Appl. 2:21-27 (1992); Unrau, P. and Deugau, K.V., Gene, 145, 163-169 (1994)]. However.* 
though all of these methods employ class-HS restriction enzymes and use the 4-nt overhangs generated by them as 

40 means for structural analysis of genomes, unlike the present invention, they do not aim at recording the expression of 
genes in a specific tissue or cell. 

In the Human Genome Project, there is vigorously argued an approach to take a tissue-derived cDNA fragment as 
a sample and to determine a partial sequence thereof as well as its location in a chromosome. In conventional methods, 
cDNA fragments are randomly taken from cDNA library. Accordingly, it is impossible to avoid a repeated sampling of the 

45 same fragment and there is a tendency that highly expressed fragments are selectively taken. 

It is an object of the present invention to provide a method which can analyze the state of expression of genes or 
deletion due to some abnormalities in a tissue or a cell in a short period and yet easily for a large quantity of genes. 

It is a further object of the present invention to provide a method which is applicable to a rapid isolation of the coding 
region of a protein as well as an amplification of restriction fragments of cloned DNA or genomic DNA. 

50 

SUMMARY OF THE INVENTION 

The present inventor has made extensive and intensive researches toward the solution of the above assignment 
and, as a result, found that, by using class-HS restriction enzymes or a combination of a class-HS restriction enzyme 
55 and a class-ll restriction enzyme, it is possible to classify (index) cDNA or DNA into groups in a short period and without 
duplication. Thus, the present invention has been achieved. 

The present invention relates to a method for molecular indexing comprising the following steps (hereinafter 
referred to as "Method I"): 
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(t) digesting cDNA which has been reverse-transcribed from tissue- or cell-derived RNA with a first restriction 
enzyme of class-IIS, 

(2) ligating each of the resultant cDNA fragments to one from a pool of 64 biotinylated adaptors cohesive to all pos- 
sible overhangs, 

s (3) digesting the resultant cDNA fragments further with a second and a third restriction enzymes of class-HS which 
are different from the first class-IIS restriction enzyme used in (1) above to thereby obtain a first cDNA sample, 
(4) obtaining a second cDNA sample by repeating the above steps (1) to (3) wherein the second class-IIS restric- 
tion enzyme is used for the initial digestion and the first and the third class-IIS restriction enzymes are used for the 
subsequent digestion, 

ro (5) obtaining a third cDNA sample by repeating the above steps (1) to (3) wherein the third class-IIS restriction 
enzyme is used for the initial digestion and the first and the second class-IIS restriction enzymes are used for the 
subsequent digestion, 

(6) recovering each of the resultant ligation samples by using streptavidin-coated paramagnetic beads and then 
removing from the samples the oligonucleotide complementary to an adaptor-primer to be used in (7), 
15 (7) amplifying each of the resultant cDNA samples by PCR using an adaptor-primer and one of anchored oligo-dT 
primers, 

(8) separating the amplified products by denaturing polyacrylamide gel electrophoresis and recording the sizes of 
the fragments obtained. * • , 

20 The present invention 'also relates to a method for molecular indexing comprising the following steps (hereinafter 
referred to as "Method ir): 

(1) digesting cDNA which has been reverse-transcribed from tissue- or cell-derived RNA, or DNA with a restriction 
enzyme of class-ll, 

25 (2) ligating each of the resultant cDNA or DNA fragments to an adaptor cohesive to ends generated by the class-ll 
restriction enzyme, 

(3) digesting the resultant cDNA or DNA fragments further with a restriction enzyme of class-US, 

(4) ligating each of the resultant cDNA or DNA fragments to one from a pool of 64 biotinylated adaptors cohesive 
to all possible overhangs, 

so (5) recovering the resultant ligated sample by using streptavidin-coated paramagnetic beads and then removing 
from said sample the oligonucleotides complementary to adaptor-primers to be used in (6), 

(6) amplifying the resultant cDNA or DNA sample by PCR using adaptor-primers, 

(7) separating the amplified products by denaturing polyacrylamide gel electrophoresis and recording the sizes of 
the fragments obtained. 

35 

BRIEF DESCRIPTION OF THE DRAWINGS 

Fig. 1 shows a schematic illustration for the principle of Method I. 

Fig. 2 shows structures of cDNA which has been synthesized by reverse-transcribing RNA using a mixture of 3 oli- 
40 gonucleotides as primers. 

Fig. 3 shows a schematic illustration for the principle of Method II. 

Fig. 4 shows an example of the polyacrylamide electrophoresis pattern of mouse liver RNA obtained by Method I. 
Rg. 5 shows an example of the polyacrylamide electrophoresis pattern of an amplified product from mouse liver 
RNA obtained by Method II. 

45 

EFFECT OF THE INVENTION 

According to Method I, it is possible to examine the state of expression of those genes having cleavage sites of 
class-IIS restriction enzymes (97% of total genes are estimated to have such sites when Fok 1, Bsm AI and Bsm Fl are 

so used) in a tissue with one to two week experiment per one human subject, since a small number of DNA sub-groups 
will do for this analysis. Furthermore, according to Method I, since the number of fragments amplified from one gene is 
only one in principle, genes can be classified (indexed) into sub-groups without redundancy. Therefore, by comparing 
the analyzed pattern between normal and abnormal tissues by using Method I, it is possible to diagnose variations such 
as tumors easily, correctly and promptly. In addition, Method 1 is also applicable to the search and isolation of genes of 

55 physiologically active substances that are potential pharmaceuticals or the causative genes of hereditary diseases, as 
well as the isolation of those genes that are useful for improving agricultural products. 

On the other hand, according to Method II, the target of analysis is not limited to RNA (or cDNA reverse-transcribed 
therefrom), since oligo-dT primers for poly A are not used as primers. According to Method II, it is also possible to 
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amplify restriction fragments of cosmid DNA or genomic DNA. Therefore, Method II is applicable to the mapping of 
these DNAs. 

In addition, regions amplified by PCR is not restricted to non-coding regions and thus it is not necessary to obtain 
clones of upstream regions in order to know the primary structure of a protein. 

5 

DETAILED DESCRIPTION OF THE INVENTION 

[I] Hereinbelow, the steps, action and effects of Method I will be described with reference to Fig. 1 . 

10 (1) First the total RNA of a cell or a tissue is converted to cDNA with a reverse-transcriptase and the resultant 
cDNA is digested with a first class-IIS restriction enzyme. 

(2) One from a pool of 64 biotinylated adaptors described below is iigated to the resultant cDNA fragments with E. 
coli DNA ligase. Each adaptor has a 4-nt 5* end overhang wherein the outermost base is a mixture of A..C, G and 
T and the inner three bases are one of all possfole sequences. (These adaptors must not be phosphorylated at their 

15 5' ends which form protruding cohesive ends.) At this point, the restriction fragments are classified into 64 sub- 
groups. 

(3) Subsequently, the cDNA fragments are further digested with a second and a third class-IIS restriction enzymes 
which are different from the first class-IIS restriction enzyme used in (1) above to thereby obtain a first cDNA sam- 
ple. 

20 A second cDNA sample is obtained by repeating the above steps (1) to (3) wherein the second class-IIS 

restriction enzyme is used for the initial digestion and the first and the third class-IIS restriction enzymes are used, 
for the subsequent digestion, and also a third cDNA sample is obtained by repeating the above steps (1) to (3) 
wherein the third class-IIS restriction enzyme is used for the initial digestion and the first and the second class-IIS 
restriction enzymes are used for the subsequent digestion. 

26 (4) As a result of digestion with the 3 class-IIS restriction enzymes described above, there are produced fragments 
which have lost poly A [see Fig. 1, CO] and fragments which still have poly A [see Rg. 1, (iQ]. Of these fragments, 
the former ones which have lost poly A will no longer be amplified in the subsequent amplification step and only the 
latter ones with poly A will be amplified. Accordingly, the latter fragments are further classified into 64 x 3 = 1 92 sub- 
groups at this point depending on the cleavage site nearest the poly A side (i.e., depending on the cleavage site of 

30 which of the three restriction enzymes used). 

(5) Subsequently, the ligation sample is recovered with streptavidin-coated paramagnetic beads and the cDNA 
fragments are treated with a dilute alkaline solution. By these operations, the oligonucleotide complementary to an 
adaptor-primer to be used in (6) is removed (the oligonucleotide will become an inhibitor against PCR reaction). 

(6) The resultant cDNA sample is amplified by PCR by using a combination of an adaptor-primer and one of 
35 d(T) 2 5A, d(T)25C and d(T) 2 gG which are anchored oligo-dT primers. Depending on the base (T, C or G) adjacent to 

the poly(A) tail, fragments amplified by the above three oligo-dT primers are determined. At this point, the cDNA 
fragments are further classified into 1 92 x 3 = 576 groups. 

(7) The amplified products are separated by denaturing polyacrylamide gel electrophoresis and the sizes of the 
fragments obtained are automatically recorded by a sequencer. 

40 

The above-described procedures are repeated with 64 adaptors, 3 class-IIS restriction enzymes and 3 anchored 
oligo-dT primers. Therefore, an RNA population is classified into 576 groups. With respect to class-IIS restriction 
enzymes, it is estimated that 97% of genes have at least one cleavage site of Fok I, Bsm Al or Bsm Fl. Accordingly, by 
using these 3 restriction enzymes in the method of the invention, it is theoretically possible to recover and present wrth- 

45 out redundancy almost all of one total RNA population. 

In addition, the above method (Method I) of the invention may be similarly carried out in a modified method which 
is different from the above only in the following points. In step (2) above, one from a pool of 256 biotinylated adaptors is 
used. Each adaptor of the pool has a four-nucleotide 5' end overhang wherein the sequence is one of al! possible 
sequences. The second digestion with class-IIS restriction enzymes described in (3) above is not carried out. 

so In this modified method, an RNA population is classified into 768 groups since 256 adaptors and 3 anchored oligo- 
dT primers are used. 

Further, in Method I, a mixture of the following oligonucleotides may be used as primers when converting the total 
RNA from a cell or tissue into cDNA with a reverse transcriptase: 
5* OH-GGATCCT 16 A-3' 
55 5 1 OH-CAGCTGT 16 C-3' 

5*OH-CTCGAGT 16 G-3' 

When such primers are used, there can be obtained cDNA molecules which have X G or C adjacent to poly (A) on 
the 5' side and a 6-base sequence added to the outside (3* side) of poly (A) (see Fig. 2). 
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• 

In this case, amplification is carried out by using any one of 5' OH-GGATCCT 16 A-3' pnstead of the above anchored 
oligo-dT primer drTfesAI, 5* OH-CAGCTGT 16 C-3* [instead of the above dCOasC] and 5' OH-CTCGAGT 16 G-3' pnstead 
of the above d(T) 25 G]. According to these procedures, analysis can be more correct because, in addition to the specif- 
icity to cDN A of only one base of the 3* end of primers, specificity to cDNA by the 6-base sequence of the 5' end of prim- 
s ers is utilized. 

The target RNA for Method I of the invention is isolated and purified from, for example, body tissues such as hemat- 
opoietic tissues including bone marrow, peripheral blood, lymphocytes, etc. or cells in a body fluid by conventional 
methods such as the guanidine thiocyanate method and the phenol-chloroform extraction method and then incubated 
with a reverse transcriptase and deoxyribonucleotide triphosphates for reverse-transcription into cDNA. 

10 With respect to the class-IIS restriction enzymes used in Method I of the invention, there is no particular limitation 
as long as the restriction enzyme forms a 5'-protruding cohesive end consisting of 4 bases. Specific examples include 
commercially available Fok I (Takara Shuzo) and Bsm Al and Bsm Fl (both manufactured by NEB). These three restric- 
tion enzymes may be used in combination for the initial digestion (with one enzyme) and the subsequent digestion (with 
two enzymes). In the modified method, one of these three enzymes may be used. 

15 In Method I of the invention, the biotinylated adaptor means the adaptor consisting of i) an oligonucleotide of 24-27 
nucleotides which forms a 4-nt 5* protruding cohesive end wherein the outermost base is a mixture of A, C, G and T 
and inner three bases are one of all possible sequences, and ii) an oligonucleotide which is complementary to the oli- 
gonucleotide i), shorter by 4 bases arid biotinylated at the 5' end. Thus, there are 64 kinds of the biotinylated adaptor! 
In the modified method, the biotinylated adaptor means the adaptor consisting of 0 an oligonucleotide of 24-27 

20 nucleotides which forms a 4-nt 5* protruding cohesive end wherein the sequence is one of all possible sequences, and 
ii) an oligonucleotide which is complementary to the oligonucleotide i), shorter by 4 bases and biotinylated at the 5' end. . 
Thus, there are 256 kinds of the biotinylated adaptors. 

In order to allow E. coli DNA ligase to recognize the 3 bases of a cDNA fragment adjacent to the binding site, phos- 
phorylation of the 5' ends of the above adaptors which form cohesive ends is not carried out. 

25 In Method I of the invention, one of the two primers used tor PCR is an oligonucleotide having a common sequence 
with the oligonucleotide constituting the adaptor described above which is subjected to ligation to cDNA at 3' end (= 
adaptor-primer). As a marker which labels this adaptor-primer, those which are used in conventional analysis may be 
used. Specific examples include fluorescent dyes, radioactive materials and enzymes. 

In Method I of the invention, another primer used for PCR is one of three oligo-dT primers, of which 3' end base is 

so A, C or G. These primers may be synthesized by a commercial nucleic acid synthesizer. 

[II] Hereinbelow, the steps, action and effects of Method II will be described with reference to Fig. 3. 

(1) First, DNA or cDNA of a cell or tissue is digested with a class- II restriction enzyme (EcoRI is used in Fig. 3). . 
35 (2) An adaptor which is cohesive to ends generated by the class-ll enzyme is ligated to each of the DNA or cDNA" 
fragments with T4 DNA ligase (the adaptor must be phosphorylated at the 5' end which form cohesive ends). 

(3) The resultant DNA or cDNA sample is further digested with a class-IIS restriction enzyme (Bsm Al is used in 
Fig- 3): 

(4) One from a pool of 64 biotinylated adaptors described below is ligated to each of the resultant cDNA or DNA 
40 fragments with E. coli DNA ligase. Each adaptor has a 4-nt 5' end overhang wherein the outermost base is a mix- 
ture of A, C, G and T, and the inner three bases are one of all possible sequences. (These adaptors must not be 
phosphorylated at their 5' ends which form cohesive ends.) At this point, the restriction fragments are classified into 
64 groups. 

(5) Subsequently, the ligation sample is recovered with streptavidin-coated paramagnetic beads and the DNA or 
45 cDNA fragments are treated with a dilute alkaline solution. By these operations, those oligonucleotides comple- 
mentary to adaptor-primers which will become inhibitors against PCR reaction are removed. 

(6) Amplification by PCR is carried out using two adaptor-primers. The one derived from the adaptor for ends gen- 
erated by the class-ll enzyme is referred to as "adaptor-primer r and the other derived from the biotinylated adap- 
tors is referred to as "adaptor-primer 2". Details will be described afterwards. 

so (7) The anplified products are separated by denaturing polyacrylamide gel electrophoresis and the sizes of the 
fragments obtained are automatically recorded by a sequencer. 

By using a class-ll restriction enzyme, a class-IIS restriction enzyme and 64 biotinylated adaptors in the operations 
described above, the DNA or cDNA fragments generated by the class-ll and class-IIS restriction enzymes used can be 
55 separated and displayed. 

When cDNA which has been reverse-transcribed from RNA is used as a target of analysis of Method II, a cDNA 
sample is prepared as follows. RNA is isolated and purified from, for example, body tissues such as hematopoietic tis- 
sues including bone marrow, peripheral blood, lymphocytes, etc. or cells in a body fluid by conventional methods such 
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as the guanidine thiocyanate method and the phenol-chloroform extraction method and then incubated with a reverse 
transcriptase and deoxyribonucleotide triphosphates for reverse-transcription into cDNA. 

It is also possible to use DNA as a target of analysis of Method 11. In this case, a DNA sample is prepared as follows. 
DNA isolated from, foi^xample, body tissues such as hematopoietic tissues including bone marrow, peripheral blood, 
5 lymphocytes, etc. or a cell suspension in a body fluid is crushed with polytron or the like and incubated with proteinase 
K to thereby degrade proteins. Then, the reaction solution is subjected to phenol extraction and 2 volumes of ethanol is 
added to the aqueous layer for precipitation. The precipitate is treated with ribonudease (RNase) not containing deox- 
yrtoonuclease (DNase) to thereby remove RNA. 

With respect to the class-ll restriction enzyme used in Method II of the invention, there is no particular limitation as 
10 long as the enzyme recognizes a specific base sequence, cut the site specifically and generate cohesive ends. Specific 
examples include EcoRI, BamHI, Hindill, Belli, Bglll, Sail, Xhol, Accl, Aval, Sau3A, Taql, Natl (which form S'-protruding 
cohesive ends), and Pstl, Sad, Kpnl, Haell (which form 3'-protruding ends). 

In particular, for the analysis of genomic DNA, restriction enzymes which recognize a 8-base sequence (e.g., Not!) 
are preferably used. 

15 With respect to the class- 1 IS restriction enzyme used in Method II of the invention, there is no particular limitation 
as long as the enzyme generates 4-base 5-protruding cohesive ends. Specific examples include commercially availa- 
ble Fok I (Takara Shuzo) and Bsm Al, Bsm Fl. SfaNI and Bbvl (all manufactured by NEB). 

It is also possible to use 2 or 3 class-HS restriction enzymes in combination to increase the number of groups as 
described in Method I. 

20 In Method II of the invention, the adaptor consists of i) an oligonucleotide of 20-30 nucleotides forming a 5'- (or 3'- 
) overhang which is cohesive to ends of restriction fragments, and ii) an oligonucleotide which is complementary to the 
above oligonucleotide i) and shorter by the number of bases forming the overhang. 

The adaptor must be phosphorylated at its 5* end (which form a cohesive end) so that an adaptor oligonucleotide 
is bound to the DNA strand which is recovered with streptavidin-coated beads. 
25 In Method II of the invention, the biotinylated adaptor means the adaptor consisting of i) an oligonucleotide of 24- 
27 nucleotides which forms a 4-nt 5' protruding cohesive end wherein the outermost base is a mixture of A, C, G and T 
and inner three bases are one of ail possible sequences, and ii) an oligonucleotide which is complementary to the oli- 
gonucleotide i), shorter by 4 bases and biotinylated at the 5' end. Thus, there are 64 kinds of the biotinylated adaptors. 
In order to allow E. coli DNA ligase to recognize the 3 bases of a cDNA fragment adjacent to the binding site, phos- 
30 phorytation of the 5' end of the above biotinylated adaptor which form a cohesive end is not carried out. 

In Method II, one of the primers used in PGR is an oligonucleotide having a common sequence with the oligonu- 
cleotide constituting the adaptor descrfoed above which is subjected to ligation to cDNA or DNA fragments at its 3' end 
(adaptor-primer 1) 

In Method II of the invention, another primer used for PCR is an oligonucleotide having a common sequence with 
35 the oligonucleotide constituting the biotinylated adaptor descrbed above which is subjected to ligation to cDNA or DNA 
fragments at its 3' end (adaptor-primer 2). As a marker which labels this adaptor-primer, those which are used in con- 
ventional analysis may be used. Specific examples include fluorescent dyes, radioactive materials and enzymes. 

These primers may be synthesized by using a commercial nucleic acid synthesizer. 

Potential target diseases which may be analyzed or diagnosed by Method I or Method If of the invention include 
40 malignant tumors such as brain tumor, stomach cancer, large intestine cancer, breast cancer, uterus cancer, skin can- 
cer, prostate cancer and malignant melanoma; virus infections such as herpes group infections, chronic hepatitis, 
cytomegalovirus infection and acquired immunodeficiency syndrome; and multifactorial hereditary diseases such as 
diabetes and hypertension. 

45 PREFERRED EMBODIMENTS OF THE INVENTION 

The present invention will be described in more detail below with reference to the following Reference Example and 
Examples, which are provided for the purpose of explanation and should not be construed as limiting the scope of the 
invention. 

so 

[Reference Example] Preparation of cDNA 

(1) Purification of RNA by ultracentrifugation 

55 Mouse livers lyophilized in dry ice or liquid nitrogen were crushed with a homogenizes To the crushed material, 5 
volumes of a GuCNS solution was added at room temperature and agitated with a vortex mixer. To a 10 ml polyallomer 
tube, 3.5 ml of 5.7 M CsG/0. 1 M EDTA solution was added and 6 ml of the resultant sample was layered over and then 
centrifuged overnight at 15°C at 32000 rpm using Beckman L70 centrifuge. 



6 



EP 0 735 144 A1 



(2) Recovery of the RNA after ultracentrifugation 

The tube was removed from the rotor and all of the supernatant was discarded. The tube wall was wiped and dried. 
Thereafter, the precipitate was dissolved in 300 pi of TE buffer. 

5 

(3) Ethanol precipitation 

To the aqueous layer, 1/10 volume of 3 M potassium acetate (pH 5.0) was added, mixed gently and placed in ice. 
Then, 2.5 volumes of ice-cooled ethanol was added to the above mixture and mixed gently. The resuttant mixture was 
10 left at -80°C for several hours and centrifuged at 4 °C for 5 minutes to precipitate RNA. The ethanol was discarded. The 
RNA precipitate was washed with ice-cooled 70% ethanol and re-centrifuged to precipitate RNA. After the ethanol was 
discarded, the RNA precipitate was dried. 

The above precipitate was dissolved in about 100 ul of sterile distilled water per 1 g of the tissue cells to obtain an 
RNA solution (RNA concentration = approx. 5 ug/ui). 

15 

(4) Preparation of cDNA template 

(4-1) Preparation of single-stranded cDNA molecules 

20 First, the resultant RNA and oligo-dT primers only were heated at 70 °C for 2-3 minutes. Then, other reagents were 
added thereto and kept at 37 °C for 1 hour to synthesize cDNA molecules. 



* Composition of the reaction solution 


5x Reverse transcriptase buffer (Gibco-BRL) 


4ul 


2mM dNTP (Pharmacia) 


4ul 


0.1MDTT 


2uJ 


10 pmol/uJ 5-amino (dT) 18 


1 Ml 


Total RNA (3 ug) and distilled water 


7.5 ul 


RNase inhibitor * 1 > (40 u/ul) (Toyobo) 


0.5 ul 


200 u/pl M-MLV Reverse transcriptase* 2 * (Gibco-BRL) 




* 1 ) derived from human placentas 
Molorty Murine Leukemia Virus 



40 

(4-2) Synthesis of double-stranded cDNA molecules 

The reaction solution described below was added to the single-stranded cDNA reaction solution and kept at 16 °C 
for 2 hours to thereby prepare double-stranded cDNA molecules. After the completion of the reaction, 3 ul of 0.25 M 
45 EDTA (pH 7.5) and 2 ul of 5 M NaCI were added thereto. Then, phenol extraction and ethanol precipitation were con- 
ducted and the precipitate was dissolved in 240 ul of distilled water. 



* Composition of the reaction solution 


10 mM MgCI 2 


70 \l\ 


1 M Tris-CI {pH 7.5) 


10 ul 


1 M (NH 4 ) 2 S0 4 


1.5 ul 


RNase H (Toyobo) (1 u/ul) 


1.5 mI 


E. coli DNA polymerase I (Toyobo) (10 u/uJ) 


4.5 ul 
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[Example 1] Analysis by the DNA molecular indexing method 
(1) Digestion with a class-IIS restriction enzyme (initial digestion) 

5 The cDN A prepared in Reference Example described above was digested with a restriction enzyme by keeping the 
cDNA in any one of the following reaction solutions (A) to (C) at a specified temperature under specified conditions. 



* Composition of the reaction solution (A) (using Fbk I) 


10xM buffer 


10 til 


0.1 % BSA (Takara Shuzo) 


10 nl 


cDNA sample 


80 til 


Fok I (Takara Shuzo) (10 uAil) 


0.5 ul 


Kept at 37 °C for 50 minutes to 1 hour. 



20 



* Composition of the reaction solution (B) (using Bsm Al) 


10x buffer for Bsm Al (NEB) 


10 


0.1 % BSA 


10fil 


cDNA sample 


80 |d 


Bsm Al (NEB) (5 u/jil) 


ini 


Kept at 55 °C for 50 minutes to 1 hour. 



* Composition of the reaction solution (C) (using Bsm Fl) 


10x H buffer 


10 ul 


Distilled water 


10 ul 


cDNA sample 


80 pi 


Bsm Fi (NEB) (5 u/^l) 


1 Ml 


Kept at 65 °C for 50 minutes to 1 hour. 



45 

After the completion of each of the reactions (i), (ii) and (iii) above, 3 *il of 0.25 M EDTA (pH 7.5) and 2 ul of 5 M 
NaCI were added to each reaction solution. Then, phenol extraction and ethanol precipitation were conducted and each 
precipitate was dissolved in 70 mI of distilled water. 

so (2) Addition of adaptors 

To the cDNA fragments obtained in (1) above, one of the following adaptors having the sequences described below: 
C1T adaptors: 

S'-B-GTACATATTGTCGTTAGAACGCT-S' 
55 5 , -NXYZAGCGTTCTAACGACAATATGTAC-3' 
or 

C1G adaptors: 

5-B<3TACATATTGTCGTTAGAACGCG-3' 
5 , -NXYZCGCGTTCTAACGACAATATGTAC-3' 
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(wherein B represents biolin; N represents any of the four bases; and XY2 represents one of the 64 possible 
sequences. When YZ = AA, AT, TA O R TT C1 G adaptor were used. Otherwise, C1T adaptors were used.) . ; - ^ 
were added and kept in the following reaction solution at 16°C overnight, to thereby ligate the cDNA fragments to the 
adaptors. 

s 



* Composition of the reaction solution 


10x E. coli DNA ligase buffer 


1 n" 


100 mM (NH 4 ) 2 S0 4 


1 n" 


1 pmoi/fil adaptor solution 


1 Mi 


cDNA sample digested with a class-IIS restriction enzyme 


"» Hi 


E. coli DNA ligase 


3 units 


Distilled water to make 10 jil 



(when the sequence XYZ did not contain G nor C, 5 pmol/pi adaptor solution and 30 units of E. coli DNA ligase; 
20 were used.) 

(3) Digestion with class-IIS restriction enzymes (the second digestion) 

The cDN A fragments obtained in (2) above were further digested with class-IIS restriction enzymes by keeping the 
25 cDNA sample at a specified temperature under the conditions specified below: 

(i) When a Fok I digest was used: 

40 |il of distilled water and 5 (il of 1 0x H buffer were added. 
Bsm Fl (1 unit) was added and kept at 65°C for 50 minutes. 
30 Bsm Al (1 unit) was added and kept at 55°C for 50 minutes. 

(ii) When a Bsm Al digest was used: 

40 pJ of distilled water and 5 \i\ of 1 0x T buffer were added. 
Fbk I (1 unit) was added and kept at 37°C for 50 minutes. 
Bsm Fl (1 unit) was added and kept at 65° C for 50 minutes 
35 (iii) When a Bsm Fl digest was used: 

40 iil of distilled water and 5 \i\ of 1 0x M buffer were added. 

Fbk I (1 unit) was added and kept at 37°C for 50 minutes. 

Bsm Al (1 unit) and 1 ul of 4 M NaCI were added and kept at 55°C for 50 minutes. 

40 (4) Amplification by PCR 

(4-1) Recovery of the adaptor molecules with paramagnetic beads 

Immediately before use, streptavidin-coated paramagnetic beads were washed twice with 0.1% BSA and once with 
45 1x B&W buffer (10 mM Tris-CI pH 7.5, 1 M NaCI, 1 mM EDTA) and then suspended in an equal volume of 1x B&W 
buffer. 

To each sample, 1 5 |J of 5 M NaCI and 5 y\ of the paramagnetic beads were added, left stationary for 1 5 minutes 
and washed with 1x B&W buffer once. Then, 10 nl of 0.1 M NaOH was added thereto and left stationary for 5 minutes. 
Thereafter, the resultant mixture was washed with 50 yA of 0.1 M NaOH once, with 1x B&W buffer once and with distilled 
so water twice. 

(4-2) PCR reaction 

The reaction solutions having the compositions described below were placed in an Eppendorf tube and heated at 
55 96°C for 1 minute to allow a prompt initiation of reactions. Then, a thermal cycle consisting of 30 seconds at 94°C, 1 
minute at 50°C and 1 minute at 72°C was repeated 25 to 35 times. After an extension step was carried out at 72°C for 
20 minutes, the reaction solution was cooled to room temperature. 
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* Compositions of the reaction solutions (per one sample) 
(i) Enzyme reaction solution 


10x PCR buffer for Stoffel fragment 


1nl 


2 mM dNTP 


M 


25 mM MgCI 2 


1.2 Kt> 


Distilled water 


4.3 *il 


10 u/|d Stoffel fragment * 1) 


0.05 ill 


(ii) Primer reaction solution 


10 pmol/jil fluorescent-CU 


0.5 pi 


10 pmol/nl d(T) 25 A [or d(T) 25 C, d(T) 25 G] 


2 Hi 



M) A portion of AmpfiTaq DNA polymerase fragment (Perkin 



Elmer) 



The primers are used in the combinations of JOE-C1T and dCTfesA; FAM-C1T and d(T) 2 5C; and TAMRA-C1T and 
d(T)2^* WOE: Z'.Z-dimethoxy^'.S'-dichloro-e-carboxyfluoresceia FAM: S'-carbcocyfluorescein, TAMRA: 6-carboxy- 
tetramethyl rhodamine (all manufactured by Perkin Elmer; the sequence of C1T: d(GTACATATTGTCGTTAGAACGCT)]. 

Alternatively, when C1G adaptors were used, the composition of the primer reaction solution is: 



10 pmol/uJ fluorescent-C1G 



10 pmol/pl d(T) 25 A [or d(T) 25 C, dfTfesG] 



0.5 
Hi 

2pl 



The primers are used in the combinations of JOE-C1G and d(T) 2 5A; FAM-C1G and d(T) 25 C; and TAMRA-C1G and 
d(7) 2 ^3. [JOE: 2\7<iimethoocy-4 , ,5 , -dichloro-6-carboxyfluorescein i FAM: S'-carbaxyfluorescein, TAMRA: 6-carboxy- 
tetramethyl rhodamine (all manufactured by Perkin Elmer; the sequence of C1G: d(GTACATATTGTCGTTA- 
GAACGCG)]. 

(4-3) Preparation of Electrophoresis Samples 

From each of the reaction products, a sample was taken as follows: 1 pi from the combination of FAM-C1 and 
d(T)2$C. 3 |il from the combination of JOE-C1 and d(T) 25 A and 3 |il from the combination of TAMRA-C1 and d(T) 2 $G. 
To each sample, 5 pi of T4 DPase solution having the following composition was added and reacted at 37°C for 40 min- 
utes. 



* Composition of T4 DPase solution (per one sample) 


1 0x M buffer 


0.5 pi 


2 mM dNTP 


0.5 |il 


Distilled water 


4 pi 


T4 DNA polymerase (Toyobo) 


1 unit 



After ethanol precipitation of the reaction solution, 3.5 pi of a buffer (80% formaldehyde, 10 mM EDTA, 6 mg/ml blue 
dextran) was added to the sample (i.e., precipitate), heated at 95°C for 4 minutes, then immediately applied to the sam- 
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pie well of ABI 373A electrophoresis apparatus (Perkin Elmer) and run (at a constant electric power of 30W for 13 
hours). * - * 

Fig. 4 shows one example of the electrophoresis patterns obtained. 

. . 5 • [Example 2] Analysis by the DNA molecular indexing method 

(1) Digestion with a dass-ll restriction enzyme 

The cDNA prepared in Reference Example described above was digested with a restriction enzyme by keeping the 
w cDNA in the following reaction solution at a specified temperature under specified conditions. 



* Composition of the reaction solution (using EcoRI) 


10xhigh salt buffer (attached to the enzyme) 


5ul 


cDNA Sample 


45 ul 


EcoRI (Toyobo or Takara Shuzo) 


Sun'rts 


Kept at 37 °C fori hour. 



20 

After the completion of the reaction, phenol extraction and ethanol precipitation were carried out and the total pre- 
cipitate was used for the subsequent reaction. 

25 

(2) Addition of adaptors 

To the cDNA fragments obtained in (1) above, the following adaptors 
5-P- AATTCTTAACC AGGCTGAACTTGCTC-3 ' 
so 5'OH-GAGCAAGTTCAGCCTGGTTAAG-3' 

were ligated by keeping the cDNA sample in the following reaction solution at 16°C for 1 6 hours or more. 



35 


* Composition of the reaction solution 


10x ligation buffer (similar to Toyobo's) 


2nl 




2.5 pmol/^l EcoRI adaptors 


2 ill 




T4DNAIigase '* 


150 units 


40 


Total volume 


20 til 



After the completion of the reaction, phenol extraction and ethanol precipitation were carried out and the total pre- 
45 cipitate was used for the subsequent reaction. 

(3) Digestion with a class-IIS restriction enzyme 

The cDNA treated in (2) above was further digested with a restriction enzyme by keeping the cDNA sample in the 
so following reaction solution at a specified temperature under specified conditions. 



55 
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* Composition of the reaction solution (using Bsm Al) 


10* buffer for Bsm AI(NEB) 


10 mI 


0.1% BSA 


10 <i! 


cDNA sample 


80 Ml 


Bsm Al (NEB) (Su/jil) 


0.5 mI 


Kept at 65°C for 50 minutes to 1 hour. 



After the completion of the reaction, phenol extraction and ethanol precipitation were carried out and the precipitate 
15 was dissolved in 30 \i\ of purified water. 

(4) Addition of biotinylated adaptors 

To the cDNA fragments obtained in (3) above, the following adaptors: 

20 

C1T adaptors: 5 , -B-GTACATATTGTCGTTAGAACGCT-3' 

5'-OH-NXY7AGCGTTCTAACGACAATATGTAC-3' 
C1G adaptors: S'-B^TACATATTGTCGTTAGAACGCG-S' 

5*-OH-NXYZCGCGTTCTAACGACAATATGTAC-3' 

25 

(wherein B represents biotin; N represents any of the four bases; and XYZ represents one of the 64 possible 
sequences. When YZ = AT or TA, C1G sequences were used. Otherwise, C1T sequences were used.) 
were ligated by keeping the cDNA sample in the following reaction solution at 16°C overnight. 

30 



* Composition of the reaction solution 


10xE. coli DNA ligase buffer 


M 


100 mM (NH4)2S0 4 


Ipl 


1 pmol/Ml adaptor solution 


iMi 


cDNA fragments digested with a class-IIS restriction enzyme 


1n" 


E. cdi DNA ligase 


3 units 


Distilled water 


to make 10 m' 



(when the sequence XYZ did not contain G nor C, 5 pmol/Ml adaptor solution and 6 units of E. coli DNA ligase 
were used.) 

45 

(5) Amplification by PCR 

(5-1) Recovery of the adaptor molecules with paramagnetic beads 

so Immediately before use, streptavidin-coated paramagnetic beads were washed twice with 0.1% BSA and once with 
1x B&W buffer (10 mM Tris-CI pH 7.5, 1 M NaCI, 1 mM EDTA) and then suspended in an equal volume of 1x B&W 
buffer. 

To the sample, 1 5 m< of 5 M NaCI and 5 (J of the paramagnetic beads were added , left stationary for 1 5 minutes and 
washed with 1x B&W buffer once. Then, 10 m' of 0.1 M NaOH was added thereto and left stationary for 5 minutes. 
55 Thereafter, the resultant mixture was washed with 50 m' of 0. 1 M NaOH once, with 1 x B&W buffer once and with distilled 
water twice. 



12 



EP 0 735 144 A1 



(5-2) PCR reaction 

The reaction solutions having the compositions described below were placed in an Eppendorf tube and heated at 
96°C for 1 minute to allow a prompt initiation of reactions. Then, a thermal cycle consisting of 30 seconds at 94°C , 1 
s minute at 50°C and 1 minute at 72°C was repeated 25 to 35 times. After an extension step was carried out at 72°C for 
20 minutes, the reaction solution was cooled to room temperature. 



* Compositions of the reaction solutions (per one sample) 


(i) Enzyme reaction solution 


10x PCR buffer for Stoffel fragment 


1 \i\ 


2 mM dNTP 


titl 


25 mM MgCI 2 


1.2 jil 


Distilled water 


4.3 nl 


10 u/nl Stoffel fragment * 1) 


0.05 jil 


(ii) Primer reaction solution 


10 pmol/iti fluorescent-CTS primer 


0.5 \i\ 


10 pmol/jil X gt10 forward primer 


0.5 ^1 



* 1 ) A portion of AmpliTaq DNA polymerase fragment (Perkin 
Elmer) 

25 



The two kinds of primers having the following sequences are used in combination: 
S'-OH-GTACATATTGTCGTTAGAACGC-SXCI S primer) 
30 S'-OH-GAGCAAGTTCAGCCTGGTTAAG-SXA. gt1 0 forward primer) 

(5-3) Preparation of Electrophoresis Samples 

A 3 jJ sample was taken from the reaction products and 5 jii of T4 DPase solution having the following composition 
35 was added thereto. The resultant mixture was reacted at 37°C for 40 minutes. 



* Composition of T4 DPase solution (per one sample) 


10xM buffer 


0.5 nl 


2 mM dNTP 


0.5 ill 


Distilled water 


4 jil 


T4 DNA polymerase (Toyobo) 


1 unit 



45 



After ethanol precipitation of the reaction solution, 3.5 ^l of a buffer (80% formaldehyde, 10 mM EDTA, 6 mg/ml blue 
dextran) was added to the sample (i.e., precipitate), heated at 95°C for 4 minutes, then immediately applied to the sam- 
so pie well of ABI 373A electrophoresis apparatus (Perkin Elmer) and run (at a constant electric power of 30W for 13 
hours). 

Fig. 5 shows one example of the electrophoresis patterns obtained. 
Claims 

55 

1 . A method for molecular indexing comprising the following steps: 

(1) digesting cDNA which has been reverse-transcribed from tissue- or cell-derived RNA with a first restriction 
enzyme of class-! IS, 
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(2) ligating each of the resultant cDNA fragments to one from a pool of 64 biotinytated adaptors cohesive to all 
possible overhangs, 

(3) digesting the resultant cDNA fragments further with a second and a third restriction enzymes of ciass-IIS 
which are different from the first class-IIS restriction enzyme used in (1) above to thereby obtain a first cDNA 
sample, 

(4) obtaining a cDNA second sarrple by repeating the above steps (1) to (3) wherein the second class-IIS 
restriction enzyme is used for the initial digestion and the first and the third class-IIS restriction enzymes are 
used for the subsequent digestion, 

(5) obtaining a third cDNA sample by repeating the above steps (1) to (3) wherein the third class-IIS restriction 
enzyme is used tor the initial digestion and the first and the second class-IIS restriction enzymes are used for 
the subsequent digestion, 

(6) recovering each of the resultant ligation samples by using streptavidin-coated paramagnetic beads and 
then removing from said samples the oligonucleotide complementary to an adaptor-primer to be used in (7), 

(7) amplifying each of the resultant cDNA samples by PCR using an adaptor-primer and one of anchored oligo- 
dT primers, 

(8) separating the amplified products by denaturing polyacrylamide gel electrophoresis and recording the sizes 
of the fragments obtained. 

2. The method of claim 1 , wherein the class-IIS restriction enzymes are used in the combination of Fok f, Bsm Al ancf 
Bsm Fl. 

3. The method of claim 1 , wherein the cDNA has been synthesized by reverse-transcripting a tissue- or cell-derived 
RNA using a mixture of the following oligonucleotides as primers: 

5* OH-GGATCCTieA-S' 
SOH-CAGCTGT^C-S' 
SOH-CTCGAGT^-S' 

4. A method for molecular indexing comprising the following steps: 

(1) digesting cDNA which has been reverse-transcribed from tissue- or cell-derived RNA, or DNA with a restric- 
tion enzyme of class-ll, 

(2) ligating each of the resultant cDNA or DNA fragments to an adaptor cohesive to ends generated by the 
class-ll restriction enzyme, 

(3) digesting the resultant cDNA or DNA fragments further with a restriction enzyme of class-IIS, 

(4) ligating each of the resultant cDN A or DNA fragments to one from a pool of 64 biotinylated adaptors cohe- 
sive to all possible overhangs, 

(5) recovering the resultant ligation sample by using streptavidin-coated paramagnetic beads and then remov- 
ing from said sample the oligonucleotides complementary to adaptor-primers to be used in (6), 

(6) amplifying the resultant cDNA or DNA sample by PCR using adaptor-primers. 

(7) separating the amplified products by denaturing polyacrylamide gel electrophoresis and recording the sizes 
of the fragments obtained. 

5. The method of claim 4, wherein the class-US restriction enzyme is any one of Fok l t Bsm Al, Bsm Fl, Sfa Nl or Bbv I. 
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FIG.1 
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FIG. 2 

When the base adjacent to poly A on the 5' side is T: 

TA.sGGATCC 

AT i c CCTAGG 

When the base adjacent to poly A on the 5' - side is G: 

GA i « CAGCTG 

CT 1S GTCGAC 

When the base adjacent to poly A on the 5' side is C: 

CA, eCTCGAG 

GT i t GAGCTC 
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FIG. 3 
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FIG. 4 
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Adaptor: NCCC 

Sample: Mouse liver cDNA (Fok I was used for the initial 
digestion .) 
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FIG. 5 
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Sample: Mouse liver cDNA (digested with EcoRI and Bsm AI) 
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